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Abstract 

The palindromization map in a free monoid A* was introduced in 1997 by the first author in the 
case of a binary alphabet A, and later extended by other authors to arbitrary alphabets. Acting on 
infinite words, (/r generates the class of standard episturmian words, including standard Arnoux- 
Rauzy words. In this paper we generalize the palindromization map, starting with a given code 
X over A. The new map (Ax maps X* to the set PAL of palindromes of A*. In this way some 
properties of i/r are lost and some are saved in a weak form. When X has a finite deciphering 
delay one can extend to X''\ generating a class of infinite words much wider than standard 
episturmian words. For a finite and maximal code X over A, we give a suitable generalization of 
standard Arnoux-Rauzy words, called X-AR words. We prove that any X-AR word is a morphic 
image of a standard Arnoux-Rauzy word and we determine some suitable linear lower and upper 
bounds to its factor complexity. 

For any code X we say that ij/x is conservative when ijjxiX*) c X* . We study conservative 
maps ij/x and conditions on X assuring that ij/x is conservative. We also investigate the special 
case of morphic-conservative maps ij/x, i.e., maps such that (pot// = if/x o tp for an injective 
morphism (p. Finally, we generalize i/'x by replacing palindromic closure with )?-palindromic 
closure, where j? is any involutory antimorphism of A*. This yields an extension of the class of 
??-standard words introduced by the authors in 2006. 

Keywords: Palindromic closure, Episturmian words, Arnoux-Rauzy words. Generalized 
palindromization map, Pseudopalindromes 
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1. Introduction 

A simple method of constructing all standard Sturmian words was introduced by the first 
author in 1 1]. It is based on an operator definable in any free monoid A* and called right palin- 
dromic closure, which maps each word w 6 A* into the shortest palindrome of A* having w as a 
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prefix. Any given word v E A* can suitably 'direct' subsequent iterations of the preceding oper- 
ator according to the sequence of letters in v, as follows: at each step, one concatenates the next 
letter of v to the right of the already constructed palindrome and then takes the right palindromic 
closure. Thus, starting with any directive word v, one generates a palindrome i/'(v). The map tfr, 
called palindromization map, is injective; the word v is called the directive word of iA(v). 

Since for any u,v e A*, \l/{uv) has ij/iu) as a prefix, one can extend the map ijj to right infinite 
words X e A" producing an infinite word (/'(x). It has been proved in f7| that if each letter of a 
binary alphabet A occurs infinitely often in x, then one can generate all standard Sturmian words. 

The palindromization map \l/ has been extended to infinite words over an arbitrary alphabet 
A by X. Droubay, J. Justin, and G. Pirillo in [2], where the family of standard episturmian words 
over A has been introduced. In the case that each letter of A occurs infinitely often in the directive 
word, one obtains the class of standard Arnoux-Rauzy words A standard Arnoux-Rauzy 

word over a binary alphabet is a standard Sturmian word. 

Some generalizations of the palindromization map have been given. In particular, in |5] a 
i?-palindromization map, where d- is any involutory antimorphism of a free monoid, has been 
introduced. By acting with this operator on any infinite word one obtains a class of words larger 
than the class of standard episturmian, called )?-standard words; when § is the reversal operator 
one obtains the class of standard episturmian words. Moreover, the palindromization map has 
been recently extended to the case of the free group F2 by C. Kassel and C. Reutenauer in |l6t]. A 
recent survey on palindromization map and its generalizations is in |01 • 

In this paper we introduce a natural generalization of the palindromization map which is con- 
siderably more powerful than the map ^ since it allows to generate a class of infinite words much 
wider than standard episturmian words. The generalization is obtained by replacing the alphabet 
A with a code X over A and then 'directing' the successive applications of the right-palindromic 
closure operator by a sequence of words of the code X. Since any non-empty element of X* can 
be uniquely factorized by the words of X, one can uniquely map any word of X* to a palindrome. 
In this way it is possible associate to every code X over A a generalized palindromization map 
denoted by if/x- \fX-A one reobtains the usual palindromization map. 

General properties of the map ij/x are considered in Section[3] Some properties satisfied by ij/ 
are lost and others are saved in a weak form. In general ij/x is not injective; if X is a prefix code, 
then ^x is injective. Moreover, for any code X,w e X*, and x eX one has that ijjxiw) is a prefix 
oiif/xiwx). 

In Section m the generalized palindromization map is extended to infinite words of X". In 
order to define a map ^x ■ — > A" one needs that the code X has a finite deciphering delay, 
i.e., any word of X'^ can be uniquely factorized in terms of the elements of X. For any t e X" 
the word s - 4'x{t) is trivially closed under reversal, i.e., if m is a factor of s, then so will be 
its reversal u~ . If X is a prefix code, the map ij/x ■ X'^ — > A'^ is injective. Moreover, one can 
prove that if X is a finite code having a finite deciphering delay, then for any t e X'^ the word 
^x{t) is uniformly recurrent. We show that one can generate all standard Sturmian words by the 
palindromization map ^x with X - A^. Furthermore, one can also construct the Thue-Morse 
word by using the generalized palindromization map relative to a suitable infinite code. 

In Section |5] we consider the case of a map ^x ■ X" A" in the hypothesis that X is a 
maximal finite code. From a basic theorem of Schiitzenberger the code X must have a deciphering 
delay equal to 0, i.e., X has to be a maximal prefix code. Given y = xi ■ ■ • jc, ■ • ■ e X'^ with x, e X, 
i > 1, we say that the word s - ilfxiy) is a generalized Arnoux-Rauzy word relative to X, briefly 
X-AR word, if for any word x e X there exist infinitely many integers / such that x = x, . IfX-A 
one obtains the usual definition of standard Arnoux-Rauzy word. 
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Some properties of the generalized Arnoux-Rauzy words are proved. In particular, any X-AR 
word s is w-power free, i.e., any non-empty factor of s has a power which is not a factor of s. We 
prove that the number S r(n) of right special factors of s of length n for a sufficiently large n has 
the lower bound given by the number of proper prefixes of X, i.e., (card(X) - l)/{d - 1), where 
d = card(A). From this one obtains that for a sufficiently large n, the factor complexity Ps(n) has 
the lower bound (card(Z) - l)n + c, with c € Z. Moreover, we prove that for all n, Psin) has 
the linear upper bound 2 card(X)n + b with b e Z. The proof of this latter result is based on a 
theorem which gives a suitable generalization of a formula of Justin ifisl . A further consequence 
of this theorem is that any X-AR word is a morphic image of a standard Arnoux-Rauzy word on 
an alphabet of card(X) letters. An interesting property showing that any X-AR word s belongs to 
X'^ is proved in Section|6] 

In Section |6] we consider a palindromization map if/x satisfying the condition ipxiX*) Q X*. 
We say that iffx is conservative. Some general properties of conservative maps are studied and a 
sufficient condition on X assuring that t//x is conservative is given. A special case of conservative 
map is the following: let (p : A* — > B* be an injective morphism such that (f(A) = X. The map 
iffx is called morphic-conservative if for all w E A*, (p{iff(w)) = t//x(<p(w)). We prove that if iffx is 
morphic-conservative, then X c PAL, where PAL is the set of palindromes, and X has to be a 
bifix code. This implies that ifrx is injective. Moreover one has that i/zx is morphic-conservative 
if and only if X c PAL, X is prefix, and ifrx is conservative. Any morphic-conservative map 
ifrx can be extended to X" and the infinite words which are generated are images by an injective 
morphism of epistandard words. An interesting generalization of conservative map to the case 
of infinite words is the following: a map t//x, with X a code having a finite deciphering delay, 
is weakly conservative if for any t e X'^, 4'x{t) e X". If ij/x is conservative, then it is trivially 
weakly conservative, whereas the converse is not in general true. We prove that if X is a finite 
maximal code, then ^x is weakly conservative. 

In Section|2]we give an extension of the generalized palindromization map i/'x by replacing 
the palindromic closure operator with the )?-palindromic closure operator, where § is an arbitrary 
involutory antimorphism in A*. In this way one can define a generalized i?-palindromization map 
^{>,x ■ X* — > PAL,t, where PAL§ is the set of fixed points of i? ()?-palindromes). If X is a code 
having a finite deciphering delay one can extend ifi^ x to X" obtaining a class of infinite words 
larger than the ??-standard words introduced in |5]. We limit ourselves to proving a noteworthy 
theorem showing that t//§ - ^,joij/ - ij/,<f x ° y"i? where X - fiitiA) and ju^ is the injective morphism 
defined for any a e A as ju,y(fl) = a if a = §(a) and ii§(a) - a§{a), otherwise. 

2. Notation and preliminaries 

Let A be a non-empty finite set, or alphabet. In the following. A* will denote the/ree monoid 
generated by A. The elements of A are called letters and those of A* words. The identity element 
of A* is called empty word and it is denoted by e. We shall set A^ = A* \ [e]. A word w e A^ can 
be written uniquely as a product of letters w - a\a2 ■ ■ - an, with a, e A, / = 1, . . . , n. The integer 
n is called the length of w and is denoted by Ivv]. The length of e is conventionally 0. 

Let w e A*. A word v is a. factor of w if there exist words r and s such that w - rvs. A factor 
V of w is proper if v w. If r = e (resp. s - s), then v is called a prefix (resp. suffix) of w. If v 
is a prefix (resp. suffix) of w, then v^'w (resp. wv"') denotes the word u such that vu - w (resp. 
uv - w). If V is a prefix of w we shall write v < w and, if v w, v < w. 

A word w is called primitive if w ^ v", for all v 6 A* and « > 1. We let PRIM denote the set 
of all primitive words of A*. 
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The reversal of a word w - a\a2 - ■ ■ a„, with a, 6 A, 1 < i < n, is the word w~ - a„ - ■ - ax. 
One sets e~ = e. A palindrome is a word which equals its reversal. The set of all palindromes 
over A will be denoted by PAL{A), or PAL when no confusion arises. For any X c A* we set 
X~ - {x~ I X e X]. For any word w e A* we let LPS{w) denote the longest palindromic suffix of 
w. For X c A*, we set LPS {X) - {LPS (x) \ x e X}. A word w is said to be rich in palindromes, 
or simply rich, if it has the maximal possible number of distinct palindromic factors, namely 
|w| + 1 (cf. 0). 

A right infinite word, or simply infinite word, w is just an infinite sequence of letters: 

w = a\a2 ■ ■ - an - ■ ■ , where a, e A, for all / > 1 . 

For any integer n > 0, W[„] will denote the prefix a\a2 - ■ - an of w of length n. A factor of w is 
either the empty word or any sequence a; • ■ • cij with / < j. If w = uvvv ■ ■ ■ y ■ ■ ■ = uv" with u e A* 
and V € A^, then w is called ultimately periodic and periodic if m = e. 

The set of all infinite words over A is denoted by A'^. We also set A°° - A* U A". For any 
w e A°° we denote respectively by Factvv and Pref w the sets of all factors and prefixes of the 
word w. For X c A*, Pref X denotes the set of all prefixes of the words of X. 

Let w e A°°. A factor m of w is right special (resp. left special) if there exist two letters 
a,b & A, a + b, such that ua and ub (resp. au and bu) are factors of w. The factor m is called 
bispecial if it is right and left special. The order of a right (resp. left) special factor m of w is the 
number of distinct letters a € A such that ua e Fact w (resp. au e Fact w). 

Let w 6 A°° and u a factor of w. An occurrence of m in w is any A e A* such that /Im < w. If 
Ai and A2 are two distinct occurrences of m in w with \Ai \ < I/I2I, the gap between the occurrences 
is \A2\ - \A\\. For any w e A* and letter a e A, \w\a denotes the number of occurrences of the letter 
a in w. 

The factor complexity p„ of a word w E A°° is the map p^ : N — > N counting for each « > 
the distinct factors of w of length n, i.e., 

p„,(n) = card(A" n Factw). 

The following recursive formula (see, for instance, |8]) allows one to compute the factor com- 
plexity in terms of right special factors: for all n > 

d 

p„(n + 1) = pAn) + ^(; - \)sr(f n), (1) 

where d - card(A), and s, (7, n) is the number of right special factors of w of length n and order 
J- 

A morphism (resp. antimorphism) from A* to the free monoid B* is any map : A* — > B* 
such that (p(uv) - (p(u)(f(v) (resp. (p(uv) - (p{v)(p{u)) for all u,v e A*. A morphism (p can be 
naturally extended to A^ by setting for any w - aia2 ■ ■ ■ a„ ■ ■ ■ e A"^, 

(p(w) = (p(ai)(p(a2) ■ ■ ■ (p(a„) ■■■ . 

A code over A is a subset X of A^ such that every word of X'^ admits a unique factorization 
by the elements of X (cf. |9]). A subset of A^ with the property that none of its elements is a 
proper prefix (resp. suffix) of any other is trivially a code, usually called prefix (resp. suffix). We 
recall that if X is a prefix (resp. suffix) code, then X* is right unitary (resp. left unitary), i.e., for 
all p €X* and w g A*, pw e X* (resp. wp e X*) implies w g X*. 
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A bifix code is a code which is both prefix and suflix. A code X is called infix if no word of X 
is a proper factor of another word of X. A code X will be called weakly overlap-free if no word 
X e X can be factorized as x = sp where s and p are respectively a proper non-empty suffix of a 
word x' e X and a proper non-empty prefix of a word x" e X. Note that the code X = {o/jZ?, Zj/jc) 
is not overlap-free | lOJ, but it is weakly overlap free. 

A code X has a finite deciphering delay if there exists an integer k such that for all x, x' e X, 
if xX^A* n x'X* + then x - x' . The minimal k for which the preceding condition is satisfied is 
called deciphering delay of X. A prefix code has a deciphering delay equal to 0. 

Let X be a set of words over A. We let X" denote the set of all infinite words 

X — x\X2 - ■ ■ Xn - ■ ■ , with Xi e X, i > I. 

As is well known [9J, if X is a code having a finite deciphering delay, then any x e X" can be 
uniquely factorized by the elements of X. 

2.1. The palindromization map 

We introduce in A* the map ^-^^ : A* —> PAL which associates to any word w e A* the 
palindrome w*^* defined as the shortest palindrome having the prefix w (cf. fV\). We call w*^* the 
right palindromic closure of w. If Q - LPS (w) is the longest palindromic suffix of w = uQ, then 
one has 

= uQu~ . 

Let us now define the map 

iJ/iA*^ PAL, 

called right iterated palindromic closure, or simply palindromization map, over A*, as follows: 
ifr{s) - s and for all u 6 A*, a e A, 

i//(ua) — (i//(u)a)^'^^ . 

The following proposition summarizes some simple but noteworthy properties of the paUn- 
dromization map (cf., for instance, HI HI]): 

Proposition 2.1. The palindromization map i// over A* satisfies the following properties: for 
u, V e A* 

PI. If u is a prefix of v, then >J/(u) is a palindromic prefix (and suffix) of>j/{y). 
P2. If p is a prefix ofi//(v), then p^^^ is a prefix ofi//(v). 

P3. Every palindromic prefix oftff(v) is of the form tff(u)for some prefix u of v. 

P4. The palindromization map is injective. 

For any w e iff(A*) the unique word u such that iff(u) = w is called the directive word of w. 
One can extend i// to A'^ as follows: let w e A'^ be an infinite word 

w — a\a2 ■ ■ ■ a„ - ■ ■ , a,- E A, / > L 

Since by property PI of the preceding proposition for all n, i^(w[„]) is a prefix of iA^(w[„+i]), we 
can define the infinite word \l/{w) as: 



il/(w) = lim i/'(w[„]). 

H— >oo 
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The extended map i// : A" A" is injective. The word w is called the directive word of i//(w). 

The family of infinite words il/{A'^) is the class of the standard episturmian words, or simply 
epistandard words, over A introduced in ||2l(see also 111]). When each letter of A occurs in- 



finitely often in the directive word, one has the class of the standard Arnoux-Rauzy words ([slQ]. 
A standard Arnoux-Rauzy word over a binary alphabet is usually called standard Sturmian word. 
EpistandA will denote the class of all epistandard words over A. 

An infinite word s e A'^ is called episturmian (resp. Sturmian) if there exists a standard 
episturmian (resp. Sturmian) word t e A'^ such that Fact s = Fact t. 

The words of the set if/{A*) are the palindromic prefixes of all standard episturmian words 
over the alphabet A. They are called epicentral words, and simply central flT], in the case of a 
two-letter alphabet. 

Example 2.1. Let A - {a, b]. If w = (ab)", then the standard Sturmian word / = i//{{ab)") having 
the directive word w is the famous Fibonacci word 

f = abaababaabaab ■ ■ • 

In the case of a three letter alphabet A — {a, b, c] the standard Arnoux-Rauzy word having the 
directive word w = (abc)" is the so-called Tribonacci word 

T — abacabaabacaba ■ ■ ■ . 



3. A generalized palindromization map 

Let X be a code over the alphabet A. Any word w e can be uniquely factorized in terms 
of the elements of X. So we can introduce the map 

lAx : ^* ^ PAL, 

inductively defined for any w eX* and x e X as: 

lAx(e) = e, i^x(x) = x'-^\ 

(Ax(wx) = (iAx(w).x)<+'. 

In this way to each word w 6 X*, one can uniquely associate the palindrome tf/xiw). We call tf/x 
the palindromization map relative to the code X. \iX - A, then tj/^ — tf'- 

Example 3.1. Let A - {a, b], X - {ab, ba], and w = abbaab; X is a code so that w can be uniquely 
factorized as w = xiX2Xi with xi — ab and X2 — ba. One has: tffx(ab) — aba, tffx(abba) — 
(ababa)*^^^ — ababa, and ipxi^bbaab) — ababaababa. 

The properties of the palindromization map i/r stated in Proposition 12.11 are not in general 
satisfied by the generalized palindromization map i/r^. For instance, take X - [ab, abb} one has 
ab < abb but ifrxiab) = aba is not a prefix of if/x{obb) = abba. Property PI can be replaced by 
the following: 

Proposition 3.1. Let v - x\ ■ ■ ■ x„ with Xi € X, i — 1, . . . ,n. For any vj - x\ ■ ■ ■ Xj, 1 < j < n one 

has ipxivj) < f^xCv). IfX is a prefix code, then the following holds: for u,v e X* if u < v, then 
ipx(u) < lAx(v). 
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Proof. For any j - I, . . . ,n - I one has 

IpxiXl ■ --XjXj+l) = (ipxiXl ■ ■ ■ Xj)Xj+l)'-'^\ 

so that il/x{x\ ■ ■ ■ Xj) < 4'x{x\ ■ ■ ■ xj+i). From the transitivity of relation < it follows ^xivj) < 
i/'xCv). Let now X be a prefix code and suppose that u,v eX* and u <v. We can write v = xi ■ ■ ■ x„ 
and M = x'j • • • x^, with x,, x^. e X, / = 1 , . . . , n and j - \, . . . ,m. Since u < v, one has v - with 
f e A*. From the right unitarity of X* it follows ( e X* and, therefore, x' = x, for / = 1, . . . , m. 
From the preceding result it follows that ifrxiu) i lAx(v). □ 

Properties P2 and P3 are also in general not satisfied by i^x- As regards P2, consider, for 
instance, the code X = {a, ab, bb] and the word w - abbab. One has ijJxiyv) - abbaabba. Now 
4ix{w) has the prefix ab but not {ab)^^^ = afoa. As regards P3 take X = {abab, b] one has that 
if/x{abab) - ababa. Its palindromic prefix aba is not equal to 4'x(y) for any v &X* . 

Differently from i/', the map ij/x is not in general injective. For instance, if X is the code 
X - {ab, aba], then iffx(ab) - if/xiaba) = aba. Property P4 can be replaced by the following: 

Proposition 3.2. Let X be a prefix code over A. Then tffx is injective. 

Proof. Suppose that there exist words xi , . . . , x„,, x'j , . . . x^ e X such that 

ij^xixi ■ ■ ■ x„) = i^xix'i ■ ■ ■ x'J. 

We shall prove that m = n and that for all 1 < i < n, one has x, - x'.. 

Without loss of generality, we can suppose m < n. Let us first prove by induction that for all 
1 < / < m, one has x,- - x'.. Let us assume that xi = x'j, . . . , x^, = x^ for Q < k < m and show 
that Xk+\ - x^,^j . To this end let us set w - t//x(xi ■ ■ ■ x^) and w' = ^x(x[ ■ ■ ■ x,'„). In view of the 
preceding proposition, we can write: 

w = i^xixi ■ ■ ■ XkXk+i)^ = (Mxi ■ ■ ■ xt)xt+i)<+'^ 

and 

w' = ifrxixi ■ ■■Xkx[_^^)^' = {t//x(xi ■ ■ ■xt)xj_^i)*+'^', 
with ^' 6 A*. Now one has: 

(l^xiXl ■ ■ ■ Xk)Xk+l)'-'^'' = i^x(xi ■ ■ ■ Xk)xui^ 

and 

{tjJxixi ■ ■ ■xt)x^^i)*+' = il/xixi ■ ■ ■ Xk)x[^^^', 
with ^, e A*. Therefore, we obtain: 

W = ll/x{Xl ■ ■■Xk)xui^( = l/rx(Xl ■ ■ ■Xt)Xj_|_i^'^' = w'. 

By cancelling on the left in both the sides of previous equation the common prefix ^x{x\ ■ ■ ■ xt) 
one derives 

Xk+x^( = 4+1^'^'- (2) 
Since X is a prefix code one obtains Xk+\ - x^^j. Since an equation similar to (|2]i holds also in 
the case k-0 one has also xi = x'j . Therefore, x, — xJ for ; = 1 , . . . , m. We can write: 

il/x{x\ ■ ■ ■ X,n) = il/x{x\ ■ ■ ■ X,„x'^_^^ ■ ■ ■ x'„). 

Since by Proposition l3.1l iffx(x\ ■ ■ ■ x„,) < if/x(x\ ■ ■ ■ x,nx'^^^ ' " x'„) it follows that m-n. □ 
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A partial converse of the preceding proposition is: 

Proposition 3.3. Let X be a code such that X c PAL n PRIM. Ifipx is injective, then X is prefix. 

Proof. Let us suppose that X is not a prefix code. Then there exist words x,y eX such that x i^y 
and y - xA with A e A^ . Since x,y e PAL one has y — xA — A~x. We shall prove that the longest 
palindromic suffix LPS (yyx) of the word yyx - A~xyx is xyx. This would imply, as jc, y e PAL, 
that 

'I'xiyyx) = iyyxf^'' = A~xyxA = yyy = (yyy)*^' = ^xiyyy), 

so that i/^x would be not injective, a contradiction. 

Let us then suppose thaty = A~x - ax/3, a,/3 E A*, and that LPS (yyx) = x/3yx. This imphes 
/3y e PAL, so that, /3y = ySffXyS = yy6~ = ax/3/3~. Therefore, one has /3 - /3~ and 

j8(axj8) = (axjS)jS. 



From a classic result of combinatorics on words II13I1 . there exist w e PRIM and integers E N 



such that j0 = and y = ax/3 - vJ^ . Since y G PRIM, it follows that A; = 1, y = w, and /3 - y^ . 
As |y6| < lyl, the only possibility is h - 0, so that [3 - e, which implies LPS (yyx) - xyx. □ 



4. An extension to infinite words 

Let us now consider a code X having a finite deciphering delay. One can extend if/x to X"^ 
as follows: let x - xiX2 ■ ■ ■ x„ - ■ ■ , with jc, e X,i > 1. From Proposition 13. II for any n > \, 
4'x(x\ ■ ■ • Xn) is a proper prefix of 4'x(m ■ ■ ■ x„x„+\) so that there exists 

hm \l/x(xi ■ ■ ■ x„) = iIjx(x). 

Let us observe that the word 4'x(x) has infinitely many palindromic prefixes. This implies that 
\lfx(x) is closed under reversal, i.e., if w G ¥?lc1^x(x), then also w~ £ Facti/'xCx). If Z = A one 
obtains the usual extension of i/r to the infinite words. 

Let us explicitly remark that if X is a code with an infinite deciphering delay one cannot 
associate by the generalized palindromization map to each word x e X" a unique infinite word. 
For instance, the code X - [a, ab, bb] has an infinite deciphering delay; the word ab'^ admits two 
distinct factorizations by the elements of X. The first beginning with ab is (ab)(bb)'^, the second 
beginning with a is a(bb)'^. Using the first decomposition one can generate by the generalized 
palindromization map the infinite word (ababb)'^ and using the second the infinite word (abb)'^. 

Let us observe that the previously defined map t//x : X'^ A" is not in general injective. 
For instance, take the code X = {ab, aba} which has finite deciphering delay equal to 1. As it is 
readily verified one has t//x((ab)'^) = t//x((aba)") - (aba)". 

The following proposition holds; we omit its proof, which is very similar to that of Proposi- 
tion [321 

Proposition 4.1. Let X be a prefix code over A. Then the map tffx ■ X" — > A'^ is injective. 

The class of infinite words that one can generate by means of generalized palindromization 
maps i/'x is, in general, strictly larger than the class of standard episturmian words. 
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Example 4. 1. Let A - {a, b] and X - {a, bb}. Let x be any infinite word x - abbay with y e X'^. 
One has that ij/xiabba) - abbaabba, so that the word 4'x{x) will not be balanced (cf. IIIZI ). This 
implies that is not a Sturmian word. Let A - {a, b, c] and X = {a, abca). Take any word 

X - abcay with y e X". One has tfrxiobca) - abcacba. Since the prefix abca is not rich in 
palindromes, it follows that 4>x{x) is not an episturmian word. 

Theorem 4.2. For any finite code X having finite deciphering delay and any t e X", the word 
s — ipxit) is uniformly recurrent. 

Proof. Let f = x\X2 - ■ ■ Xn - ■ ■ e X'^, with x, e X, i > 1, and w be any factor of s. Let a be the 
shortest prefix a - x\ ■ ■ ■ Xh of t such that w e FactM, with u - i/'x(Q'). The word s is trivially 
recurrent since it has infinitely many palindromic prefixes. Hence, w occurs infinitely many times 
in s. We will show that the gaps between successive occurrences of w in i are bounded above by 
\u\ + 2(x, where ix - maXjez kl- This is certainly true within the prefix u: even if w occurs in u 
more than once, the gap between any two such occurrences cannot be longer than \u\. 

Let us then assume we proved such bound on gaps for successive occurrences of w in i//x(fi), 
where j3 - x\ ■ ■ ■ Xk, h < k, and let us prove it for occurrences in if/xifiy), where y - x^+i. We can 
write i/'xOS) - up - p~u and tfrx(fiy) - if/xiP)^ - /l~(/'x06) for some A,p e A*, so that 

^xifiy) ^ p~uA^ A~up . (3) 

By inductive hypothesis, the only gap we still need to consider is the one between the last oc- 
currence of w in p~u and the first one in up as displayed in If \p\ > \A\, then both such 
occurrences of w fall within p~u = i^xifi), so that by induction we are done. So suppose \A\ > \p\. 
As one easily verifies, the previous gap is at most equal to the gap between the two displayed 
occurrences of u in Q, namely \A\ - \p\. From Q one has: 

- IpI = l«AxO0y)| - Wxim - Qif^xm - \u\) = \if,x(J3y)\ - 2|iAx03)| + \u\. 

Now, as 

l-AxOSy)! = Ux(J3)yY*'>\ < 2(1(^x08)1 + < 2\,j,xm + 2^x , 

we have \A\ - \p\ < \u\ + 2{x- By induction, we can conclude that gaps between successive 
occurrences of w are bounded by \u\ + 2{x in the whole s, as desired. □ 

Let y - yiyi - ■ 'yn' ■ ■ 6 X'^, with e X for all / > 1 . We say that a word x e X is persistent 
in y if there exist infinitely many integers /] < i2 < ■ ■ ■ < ik < ■ ■ ■ such that x - y,-, for all A: > L 

We say that the word v = yiyi ■ • 'Jn • ■ • 6 X" is alternating if there exist distinct letters 
a,b e A, a word A e A*, and a sequence of indices io < ii <■■■</„<■■■, such that Aa < yi^^ 
and Ab < y,,,^, for all k>Q. 

We remark that if there exist two distinct words xi,X2 6 X, which are persistent in y and 
such that {xi, JC2} is a prefix code, then y is alternating. If X is finite, then the two conditions are 
actually equivalent. 

Proposition 4.3. Let y = yi ■ ■ -y^ ■ ■ ■ e X" with yi e X, i > I. Ify is alternating, then ^xiy) is 
not ultimately periodic. 

Proof. By hypothesis, there exists an increasing sequence of indices (/„)„>o, such that for all 
A: > we have Aa < yi^^ and Ab < y,,;,^, , for some A e A* and letters a b. 
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For all n > 0, let u„ denote the word ij/xiyi ■ ■ -yn)- We shall prove that u„A is a right special 
factor of s - ifxiy) for any n, thus showing that s cannot be ultimately periodic (cf. 

We can choose an integer h > satisfying i2h > n. Let us set m = i2h and xi = y/j^. Now one 
has that: 

Mm-l-^l i Mm 6 Prcf S. 

Since m„ is a prefix and a suffix of m,„_i it follows, writing xi = /la?/ for some rj e A*, that 

UnXi — u„Aarj e Fact s. 

Since i2/,+i > !2/i, setting X2 = ya/i+i = -i^??' for some //' E A*, one derives by a similar argument 
that: 

Ui,X2 — UijAbrj' € Fact s. 

From the preceding equations one has that u,,! is a right special factor of s. □ 

We shall now prove a theorem showing how one can generate all standard Sturmian words 
by the palindromization map relative to the code X = {a, b}^. We premise the following lemma 
which is essentially a restatement of a well known characterization of central words (see for 
instance 1 1, Proposition 9]). 

Lemma 4.4. Let A — [a, b} and E be the automorphism of A* interchanging the letter a with b. 
If z e A and w & A* \ z*, then 

tffiwz) — i//(w)zE(z)tlf(w') for some w' e Pref w. 

Theorem 4.5. Let A — {a,b\ and X — A^. An infinite word s € A" is standard Sturmian if and 
only if s — t//xit)for some alternating t 6 X'^ such that 

t 6 ((flfl)* U {bby){ab,bar . 

Proof. Let s = tf/xit); we can assume without loss of generality that t e (aa)''{ab, ba]" with 
k €N. Let t[2n] be the prefix of f of length 2n (which belongs to X*). We shall prove that iffx(t[2n]) 
is a central word for all n > 0. This is trivial for all prefixes t[2p] of f with p < k. Let us now 
assume, by induction, that i/'x(f[2n]) is central for a given n > k and prove that i^x(f[2n+2]) is 
central. 

We can write f[2„+2] = t[2n]ab or f[2«+2] = t[2n]ba. Since by the inductive hypothesis i/'x(f[2n]) 
is central, there exists u„ e A* such that i/'x(f[2«]) - The words iJ/x{t[2n])ob and 4'x{t[2n])ba 

are finite standard words and therefore, as is well known, prefixes of standard Sturmian words 
(cf. il2i Corollary 2.2.28]). By property P2 of Proposition 12.11 their palindromic closures 
(i/'x(f[2«])fl^)*^* = iAA'(f[2«]fl^) and (iAx(r[2«])^fl)*^* = ^x{t[in]ba) are both central. Hence, in any 
case 4'x{t['2.n+2]) is central so that there exists u„+i e A* such that ij/xih^n+i]) - iA(m«+i)- Since 
(/'(«„) is a prefix of i/'(m„+i) from Proposition l2.1l one derives that m„ < m„+i. 

We have thus proved the existence of a sequence of finite words (m„)„>o, with m, < Ui+\ for all 
i > 0, such that for all n > we have 

l/'z(f[2«]) = ^{u„) ■ 

Letting A = lim„^oo u„, we obtain s - i/'(A). Since f is alternating, s is not ultimately periodic by 
Proposition [43] so that it is a standard Sturmian word. 
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Conversely, let s be a standard Sturmian word, and let A be its directive word. Without loss 
of generality, we can assume that A begins in a; let n > 1 be such that a"b e Pref A. If n is even, 
we have 

il/{a"b) = ^(00)2/7^ — (^(aa)'^baj — ipx {{aa)'-ba^ 

whereas if n is odd, 

i//{a"b) — (^(aa)^ ab^ — if)x{(oa)~ ab^ . 

Let now z e A and uz be a prefix of A longer than a"b. By induction, we can suppose that there 
exists some w € {aay{ab,ba}* such that iff(u) - tf/xiw). From Lemma l4!4l and Proposition 12. II 
we obtain, setting z - E(z), {^{u)zz)^^^ < t//iuz) < (i/'(m)zz)^^'. Hence, 

iA(mz) = (iA(m)zz)^^' = (iAx(w)zz)(+' = lAx(wzl) . (4) 

We have thus shown how to construct arbitrarily long prefixes of the desired infinite word f, 
starting from the Sturmian word s. Since a and b both occur infinitely often in A, by (HJi we 
derive that t is alternating. □ 

Example 4.2. In the case of Fibonacci word / let us take X = {ab, bo). As it is readily verified, 
one has: 

/ = (jjxiabiabbaT). 



Let yU be the Thue-Morse morphism, and t - fJ.'^(a) the Thue-Morse word flS*]. We recall that 
ju is defined by fi{a) - ab and fj.(b) = ba. The next proposition will show that t can be obtained 
using our generalized palindromization map, relative to a suitable infinite code. 

Let us set u„ = fp-"{a) and v„ - E{un)b, for all « e N. Thus vq = bb, v\ - baabb, V2 - 
baababbaabbabaabb, and so on. 

Proposition 4.6. The set X — {a} U {v„ \ n e N} is a prefix code, and 

t = l/rx(flV()VlV2---) . 

Proof. As a consequence of fsl Theorem 8.1], we can write m„+i - fi^"^^(a) = (fi^"^^ {a)b'j \ 
Since for any A: > one has //'^^'(o) = i/{a)E (f/(afj, we obtain for all « > 

Un+l = (UnE(Un)bY^'' = (m«V„)*^* . (5) 

Since b < v, for all / > 0, by (|5]l it follows Ujb < UiVi < m,+i, so that Uib < uj whenever < / < j, 
whence E(uib) = E(ui)a < Eiuj). This implies that for < / < j, v, = E(ui)b is not a prefix of 
Vj = E(uj)b. Clearly v, is not a prefix of any with k < i, nor of a, which in turn is not a prefix 
of any v, with / e N; hence X is a prefix code. 

Since uq = a - t//xia), from ^ it follows that for all n > 0, m„ = i^x(av'o ■ ■ ■ v„_i). As 
f - lim„^oo M«, the assertion is proved. □ 
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5. Generalized Amoux-Rauzy words 

Let us suppose that the code X over the alphabet A is finite and maximal, i.e., it is not properly 
included in any other code on the same alphabet. By a classic result of Schiitzenberger either 
X is prefix or has an infinite deciphering delay i^. Therefore, if one wants to define a map 
(//x '■ X'^ —> A" one has to suppose that the code is a prefix maximal code. 

We shall now introduce a class of infinite words which are a natural generalization in our 
framework of the standard Arnoux-Rauzy words. 

Let X be a finite maximal prefix code over the alphabet A of cardinality d > \. We say that 
the word s - ^xiy), with y e X" is a standard Arnoux-Rauzy word relative to X, or X-AR word 
for short, if every word x € Xis persistent in y. 

Let us observe that if X = A we have the usual definition of standard Arnoux-Rauzy word. 
Any X- AR word is trivially alternating and therefore, from Proposition 14.31 it is not ultimately 
periodic. The following proposition extends to X-AR words a property satisfied by the classic 
standard Arnoux-Rauzy words. 

Proposition 5.1. Let s = i/'xCv) be an XAR word with y - yi ■ ■ -yn ■ ■ ■ , yi e X, i > 1. Then for 
any n > 0, u„ — ipxiyi ■ ■ "Jn) is a bispecial factor of s of order d — card(A). This implies that 
every prefix of s is a left special factor of s of order d. 

Proof. Since X is a finite maximal prefix code, it is complete i.e., it is represented by the 
leaves of a full li-ary tree (i.e., each node in the tree is either a leaf or has exactly degree d). 
Hence, X^ - A, where X^ denotes the set formed by the first letter of all words of X. Any word 
X e X is persistent in y, so that, by using an argument similar to that of the proof of Proposition 
14.31 one has that for any n > 0, 

u„X c Fact s, 

that implies m„X^ - u„A c Fact s, i.e., u„ is a right special factor of s of order d. Since s is closed 
under reversal and u„ is a palindrome, one has that m„ is also a left special factor of s of order d. 
Hence, u„ is a bispecial factor of order d. Let m be a prefix of s. There exists an integer n such 
that u < u„. From this one has that m is a left special factor of s of order d. □ 

An infinite word s over the alphabet A is co-power free if for every non-empty word u e Fact s 
there exists an integer p > such that u^ i Fact s. We recall the following result (see, for 
instance. Lemma 2.6.2]) which will be useful in the sequel: 

Lemma 5.2. A uniformly recurrent word is either periodic or oj-power free. 

Corollary 5.3. An X-AR word is co-power free. 

Proof. An X-AR word is not periodic and by Theorem l4.2l it is uniformly recurrent, so that the 
result follows from the preceding lemma. □ 

Lemma 5.4. Let X c A* be a finite set and set I = Ix - max{|x| | x e X\. Let w = wi • • • Wm, 
W; E A, / = 1, . . . , m, be a palindrome with m > (. If there exist u,v € (Pref X) \ X such that 
|m| - P, \v\ — q, P < q, and 

Wp+l ■ ■ ■ W„M = Wq+l ■ ■ ■ w„v, (6) 

then 

wi ■ --Win-p = a* a', 
where a' e Pref a, a~ is a prefix ofv of length q - p, and k > - \. 
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Proof. Let u - a\ ■ ■ ■ Up and v - bi ■ ■ - bq with ai,bj e A, i = 1, . . . , p, j - 1, . . .,q. From (|6ll one 
derives: a, = bq-p+i, i - 1, . . .,p, and 

Wp+l ■ ■ ■ Wq(Wq+l ■ ■ ■ W„) = (Wq+l ■ ■ ■ W,n)bl ■ ■ ■ bg^p. 

From a classic result of Lyndon and Schiitzenberger (cf. 11311 ). there exist A,fj. e A* and an integer 
h >Q such that: 

Wp+l ■ ■ ■ W,/ = /lyU, bl--- bq-p = yU/l, w^+i ■ ■ ■ w„ = (7) 

Hence, 



W. 



p^y-W,n^(Apr'A. 

Since w e PAL, one has for any / = 1, . . . ,m, Wi - Wm-i+\ ■ Hence, by taking the reversals of both 
the sides of the preceding equation, one has: 

wi ■ ■ ■ w„-p = w„ ■ ■ ■ = (A~p~f'^^A~ = 

having set A: = /i + 1, a = and a' = Now from a~ - pA - b\ ■ ■ ■ bq-p is a prefix of 

V. 

From (|7]) one has that m - q - h(q - p) + \A\. Since < q — p \i follows that m — q < 

h{q - p) + (q- p) = (h + l)(q - p) = k(q - p). Hence, k > As q - p < { - I and < I, 

the result follows. □ 

Lemma 5.5. Let X be a finite maximal prefix code over a d-letter alphabet. Then 

card(X) - 1 



card((PrefX)\X) = 



d-l 



Proof. The code X is represented by the set of leaves of a full t/-ary tree. The elements of the 
set (Pref X) \ X, i.e., the proper prefixes of the words of X are represented by the internal nodes 
of the tree. As is well known, the number of internal nodes of a full li-ary tree is equal to the 
number of leaves minus 1 divided by li - 1 . □ 

In the following we let Ax be the quantity 

card(X) - 1 



A^ 



d-l 



Proposition 5.6. Let s be an XAR word. There exists an integer e , such that for any non-empty 
proper prefix u of a word ofX, one has u"' i Fact s. Moreover, also (u~y' i Fact s. 

Proof. Any word x € X, as well as any prefix of x, is a factor of s. Let u be any proper non-empty 
prefix of a word of X. From Lemma l?!2l there exists an integer p such that u'' i Fact s. Let e„ be 
the smallest p such that this latter condition is satisfied. Let us set 

e, = max{e, | v e (Pref X) \ (X U {e})). 

We observe that gj is finite since X is a finite code. Therefore, for any u € (Pref X)\ {X (J {s}) 
one has 

u^' i Fact s. 

Since s is closed under reversal it follows that also (m~)''' ^ Fact s. □ 
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Theorem 5.7. Let s - i//x(y), with y - yi ■ ■ - jn - ■ ■ e X'^, yi eX, i > 1, be an X-AR word. There 
exists an integer v such that for all h > v the number S i(h) of right special factors of s of length 
h has the lower bound Ax, i.e., 

Sr(h)>Ax. 

Moreover, any such right special factor of s is of degree d. 

Proof. In the following we shall set for all n, u„ - if/xiyi ■ ■ ■}'«)■ Let { be as in Lemma l54l mo 
be the minimal integer such that ^ - 1 > Cj, and let n be an integer such that \u„\ - m > mo. 
Let us write m„ as m„ = wi ■ ■ ■ Wm with w, e A, / = 1, . . . , m. Since any word x e X is persistent 
in y it follows that u„X c Fact s. Therefore, for any proper prefix m of a word x e X one has that: 
UnU - wi ■ ■ ■ WmU is a right special factor of s of order d and length m + \u\. This implies that 

w\u\+i---w„u (8) 

is a right special factor of length m. However, for u,v e (Pref X) \ X, u v, one cannot have 

W\u\+l ■■■W,nU = W|,,|+i ■■■W„,V. 

This is trivial if \u\ = |v|. If \u\ < |v|, as u„ e PAL, by Lemma l54l one would derive: 

wi ■ ■ ■ w„_|„| = a^a' 

with k > e 5 and a equal to the reversal of a proper prefix of a word of X, which is absurd in view 
of Proposition l5.6l Thus one has that all the words of (O with u e (Pref X) \ X, are right special 
factors of s of length m and order d. Since by Lemma 1531 the number of proper prefixes of the 
words of X is Ax it follows that the number S r{m) of right special factors of length m has the 
lower bound S rim) > Ax. Thus we have proved the result for all m = \u„\ > mo. 

Let us now take h such that m < h < m' - |m„+i|. We can write u„+i = (wi ■ ■ - Wm for some 
word Since for any u e (Pref X) \ X, m„+im is a right special factor of s of length m' + \u\ and 
order d, so is its suffix of length h. We wish to prove that all such suffixes of length h, for different 
values of u in (Pref X) \ X, are distinct. Indeed, if two such suffixes were equal, for instance the 
ones corresponding to u, v € (Pref X) \ X, then their suffixes of length m would be equal, i.e., 

W|„|+l ■ ■ ■ W„U = W|,,|+l ■ ■ ■ w„v, 

which is absurd as shown above. Hence, S rih) > Ax- □ 

Corollary 5.8. Let s be an XAR word. There exists an integer v such that the factor complexity 
Ps of s has for alln > v the linear lower bound 

(card(X) - l)n + c, with c e Z. 

Proof. From the preceding theorem for all n > v, s has at least Ax right special factors of length 
n and order d. Therefore, in view of ([TJ, we can write for all « > v 

pM > pAv) + (n - v)Ax(d - 1) = ps(v) + (n - v)(card(X) - 1) 

= (card(X) - l)n + c, 



having set c - Ps(v) - v(card(X) - 1). 



□ 
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We shall prove that the factor complexity of an X-AR word s is linearly upper bounded 
(cf. Theorem l5.15l ). We need some preparatory results and a theorem (cf. Theorem l5.13l ) which 
is a suitable extension of a formula of Justin jlSll to generalized palindromization maps. 

We recall that a positive integer p is a period of the word w = ai • • • a„, a, G A, 1 < i < n 
if the following condition is satisfied: if / and j are any integers such that 1 < i,j < n and ; = j 
(mod p), then a, = Uj. We shall denote by n(w) the minimal period of w. 

Let X be a finite prefix code and {x be the maximal length of the words of X. We say that 
i/'x(xi ■ ■ • Xm) with X, e X, i > 1, is full if it satisfies the three following conditions: 

Fl. For any x e X there exists at least one integer j such that 1 < j < m and xj = x. 
F2. 7T{if/x(xi ■ ■ ■ xJ) > £x. 

F3. Forallx e X the longest palindromic prefix of ■■• x,„) followed by is (/'x(-'^i ■ ■ ■ Xr^-\), 
where is the greatest integer such that I < < m and Xr^ = x. 

Proposition 5.9. Let X be a finite prefix code, z e X^, and y & X. Ifipxiz) is full, then ij/xizy) is 
full. 

Proof. It is clear that tl'x(zy) satisfies property Fl. Moreover, one has also that 7T(tfrx{zy)) > tx- 
Indeed, otherwise since \l/x{z) is a prefix of if/xizy), one would derive that il/xiz) has a period, and 
then the minimal period, less than (x, which is a contradiction. 

Let us first prove that ijjxiz) - P, where P is the longest proper palindromic prefix of ifrxizy). 
Indeed, we can write: 

ipx(zy) = (l/x(z)yA = Pfi, 

with A,p e A* and ^ + s. One has that |f | > and, moreover, |P| < \\l/x{z)y\. This last 

inequality follows from the minimality of the length of palindromic closure. Let us then suppose 
that: 

p = ii,x{z)y' = (y)>z(z), 

with y' < y. From the Lyndon and Schiitzenberger theorem there exist a,[i e A* and n e N such 
that (y')~ = o:/3,y' = /3a, and i/'x(z) = {a/3)"a. Since if>x{z) is full, from property Fl one has 
that ^ fx, so that n > and n{\j/x{z)) < \a/5\ - \y'\ < £x which is a contradiction. Thus 

P = «Ax(z). 

From the preceding result one derives that the longest palindromic prefix of i//xizy) followed 
by y is tf/xiz). Now let x y and let Q be the longest palindromic prefix of i//xizy) followed by x. 
We can write: 

^xizy) = t//x(z)yA = Qx6, 

with 6 e A*. From the preceding result one has \Q\ < \t//x(z)\. If \Q\ - \4fxiz)\, then, as X is a 
prefix code, one gets x = y,a contradiction. Hence, \Q\ < \ifrx{z)\. We have to consider two cases: 
Case 1. \Qx\ > \tl/x(z)\. This implies 

= Qx' = (xTQ, 

with x' < X. Hence, one would derive (x')~ = uv, x' - vu, and ifix(z) = (uv)"u with u,v e A* and 
n > 0. This gives rise to a contradiction, as n(\l/x{z)) < \uv\ < (x- 

Case 2. \Qx\ < |i^x(-z)l- Let z = xi ■ ■ ■ Xm with Xi € X, 1 < i < m. In this case Q is the longest 
palindromic prefix of t//x{z) followed by x, namely i//x(xi ■ ■ ■ Xr,-i). 

In conclusion, ^xizy) satisfies conditions F1-F3 and is then full. □ 
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Lemma 5.10. Let s be an X-AR word and tffx(z), with z 6 X*, be a prefix of s. There exists 
an integer Vj such that if \4'xiz)\ > v^, then for any prefix u — i//x(zyxi ■ ■ ■ Xk) of s with k > 0, 
y,xi, . . . ,xi; € X, y ^ Xi, 1 < i < k, the longest palindromic prefix of u followed by y is ipxiz). 

Proof. Let us denote by P the longest palindrome such that Py is a prefix of u. We wish to prove 
that for a sufficiently large ij/xiz) one has that P - Let us then suppose by contradiction 

that |P| > \4'x{z)y Setting jcq - y, there exists an integer /, -1 < i <k - \ such that 

\ijJxizxQ ■ ■ ■ Xi)\ < \P\ < \>lJxizxQ ■ ■ ■ Xi^y)\, (9) 

where for ; = -1 the l.h.s. of the preceding equation reduces to \4'x{z)\- Let us prove that for 
-1 < i < k, P + 4'xizxQ ■ ■ ■ Xi). This is trivial for / = -1 and / = A; as |f | < \u\. For < / < A: - 1 
the result is a consequence of the fact that P is followed by y whereas iffxizxo ■ ■ ■ Xj) is followed 
by Xi+i. As X is a prefix code, one would obtain y - Xj+i which is a contradiction. Hence in (|9]l 
the inequalities are strict. If 

\ifrx(zxQ ■ ■ ■ x,)x,+i| < \P\ < \ij/x(zx() ■ ■ ■ 

then one would contradict the definition of palindromic closure. Thus the only possibility is that 
there exists -1 < / < A: - 1 such that 

P = ifrxizxQ ■ ■ ■ xi)p = p~^x{zxQ ■ ■ ■ Xi) 

where p is a proper non-empty prefix of Xj+i. This implies that there exist words A,fi e A* and 
an integer n > such that 

p~ ^ Afi, p ^ fiA, ilfxizxQ ■ ■ ■ Xi) = (Ap)"A. (10) 

Let us set Vj = (gj + l){x, where has been defined in Proposition 15.61 and fx is the maximal 
length of the words of X. Let us suppose that \t//x{z)\ > v,. Since 

(e. + l)£x < < \>Px(zxo ■ --Xi)] < (n + l){x, 

one would derive n> e^ and p" i Fact s which contradicts ( fTOl i and this concludes the proof. □ 

Corollary 5.11. Let s — i^x(xiX2 ■ ■ ■ x„ ■ ■ ■) be an XAR word, with x,- e X, i > 1. There exists an 
integer m > 1 such that for all n > m, 4>x(x\ ■ ■ ■ x„) is full. 

Proof. Since s is an X-AR word, for any x e X there exist infinitely many integers j such that 
X - Xj. We can take the integer m so large that for any x e X there exists at least one integer j 
such that 1 < j < m, xj - x, and, moreover, for each x e X 

\tfrx(xi ■ --Xr^-i)] > Vs. 

This assures, in view of preceding lemma, that for each x e X the longest palindromic prefix 
of if/x{xi ■ ■ ■ x,„) followed by x is if/x(xi ■ ■ ■ Xr^^i). Finally, there exists an integer m such that 
Ji{^x{x\ ■ ■ ■ Xm)) > tx- Indeed, s is w-power free, so that there exists an integer p such that 
for any non-empty factor u of s of length \u\ < tx one has u'' i Fact s. Thus if for all m, 
7i{^x{x\ ■ ■ ■ x,„)) < Ix we reach a contradiction by taking m such that \\lix{x\ ■ ■ ■ Xm)\ > ip + l)^x- 
Hence there exists an integer m such that conditions F1-F3 are all satisfied, so that t//x{xi ■ ■ ■ x^) 
is fuU. By Proposition |5]9] i//xixi ■ ■ ■ jc„) is also full, for all n> m. □ 
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Lemma 5.12. Let z & X* andy e X. Suppose that ijJxiz) has some palindromic prefixes followed 
by y, and let Ay be the longest one. Then 

^x(zy) = iAx(z)A;Vx(z) ■ 

Proof Since A = Ay is the longest palindromic prefix of followed by y, it is also the longest 
palindromic suffix preceded by y~, so that 3'~A3' is the longest palindromic suffix of t//xiz)y- Thus, 
letting t//xiz) = Ay( - ^~y~A for a suitable we obtain 

Mzy) = iMz)yt' = Cy^^yi = Mz)A''Mz) ■ □ 

Let B be a finite alphabet and /i : B — > X be a bijection to a prefix code X c A*. For z e X*, 
we define a morphism (f,:B*—> A* by setting for allb e B 

ip,{b) = iAx(z//W)«Az(z)"' = >Px{zW^ly (11) 

where for the last equality we used Lemma 15.121 

Theorem 5.13. Let s — iJ/x(xiX2 ■ ■ ■ x„ ■ ■ ■) be an XAR word with x,- e X, i > 1. If z — xi ■ ■ ■ x„j 
is such that u,„ — ij/xiz) is full and p, tp, are defined as above, then for any w € B* the following 
holds: 

^x(zp(w)) = (p,(ifr(w))iffx(z) ■ 

Proof. In the following we shall use the readily verified property that if y : B* — > A* is a 
morphism and v is a suffix of m E B*, then y(Mv"') = 'y(u)j{vyK 

We will prove the theorem by induction on \w\. It is trivial that for w - e the claim is true 
since if/(s) = e = tfizi^)- Suppose that for all the words shorter than w, the statement holds. For 
|w| > 0, we set w - vb with b e B, and let y = p(b). 

First we consider the case \v\b + 0. We can then write v - v\bv2 with Ivilfc = 0. Since i/'xfe) 
is full, so is tfrxizpiv)); hence i/zxizpivi)) is the longest palindromic prefix (resp. suffix) followed 
(resp. preceded) by y (resp. y~) in iffx(zp(v)). Therefore, by Lemma l5.12l we have 

^xizpiv)y) = t/,x{zp{v))t/,xizpivi)r't/^xizpiv)) (12) 

and, as i^(vi) is the longest palindromic prefix (resp. suffix) followed (resp. preceded) by b in 

'A(v'), 

i/fivb) = tA(v)(A(vi)"V(v) . (13) 

By induction we have: 

ifrx(zp(v)) = (p,(ifr{v))il/x(z) , iffxizpivi)) = (f,{t//{vi))t//x(z) ■ 

Replacing in (fT2l) . and by ( fTSl ). we obtain 

t/,x{zp{v)y) = ip,iif,{v))ipM(vi)r''Pzmv))Mz) 

= ^,(iA(v)iA(vi)"V(v))'Ax(z) 

= <p,{ifr(vb))iffx(z) , 

which was our aim. 
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Now suppose that \v\i, - 0. As i^x(z) is full, the longest palindromic prefix of ij/x(z) which is 
followed by y is Ay - if/x{xi ■ ■ ■ x^^-i), where r,, is the greatest integer such that 1 < r,. < m and 
x,-^, - y. By Lemma l5.12l we obtain 

t/^x(zti(v)y) = (tAx(z/^(v))y)<+> = ilfx{zfi{v))A;' i/^x(zti(v)) . (14) 

By induction, this implies 

iAx(zy"(v')3') = (PzmvMxiz)A;'<Pzm'Mxiz) . (15) 

From (fTTT ) it follows 

^,(fo) = iAx(z3')('Az(z))"' = >Px(z)A-' . 

Moreover, since i//(v) has no palindromic prefix (resp. suffix) followed (resp. preceded) by y 
one has 

ifr(vb) = il/(v)bi//(v) . (16) 

Thus from ( fTSl l we obtain 

(/'x(z/u(v)y) = (p,{>tj{v))ip,{b)ip:,{i^(v))>f/x{z) 
= ^,((A(v)Z7iA(v))iAx(z) 
= ipmvbMxiz), 

which completes the proof. □ 

Corollary 5.14. Every X-AR word is a morphic image of a standard Arnoux-Rauzy word over 
an alphabet B of the same cardinality as X. 

Proof. Let s - 4ix{x\X2 ■ ■ ■ x„ ■ • ■ ) be an X-AR word with x,- eX,i> 1, and let x,- - p(bi) for all 
/ > 1, where yu : B — » X is a bijection. By the preceding theorem, there exists an integer m > 1 
such that, setting z = xi ■ ■ ■ for all w E B* we have tfrxizpiw)) = (f:(t//(w))ij/xiz)- Hence for all 
A; > OT we have 

if/x(xi ■■■Xk)^ ip,(t^(b,„+i ■ ■ ■ bk))i^x(z) , 
so that taking the limit of both sides as A: — > oo, we get 

s = (Pz(i^(b„+ib,„+2 ■■■bn---)) ■ 

The assertion follows, as each letter of B occurs infinitely often in the word bm+ibm+2 ■ ■ - bn - ■ ■ . 

□ 

Example 5.1. Let X - {aa, ab, b}, B - {a, b, c], and /i : Z? — > X be defined by fi(a) - ab, p(b) - b, 
and jj(c) = aa. Let s be the X-AR word 

s — tlixiiabbaaf) — ababaaababaababaaabababaaababaababaaaba ■ ■ ■ . 

Setting z - abbaa, it is easy to verify that the prefix ij/xiz) - ababaaababa of s is full, so that 
s = it//iiabc)'^)), where (pzia) — ababaaababa, tpzib) — ababaaab, and ^z(c) = ababaa. 

Let s - if^xixi • • • jc„ • • • ) be an X-AR word with x,- e X, i > 1, and let mo be the minimal 
integer such that - ^x{x\ ■ ■ ■ x,„^) is full. For all y > we shall set 

aj = Mm„+y and nj = \aj\. 
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Theorem 5.15. Let s be an X-AR word. Then the factor complexity of s is linearly upper 
bounded. More precisely for all n > no 

Ps(n) < 2 card(X)n - card(X). 

Proof We shall first prove that for all j >Q 

pArij) < caxA{X)nj - card(X). (17) 

Let be a bijection of an alphabet B and X. We set Zj - xi - ■ ■ Xm^+j and consider the morphism 
If,. : B* ^ A* defined, in view of ( fTTT l. for all e B as: 

ip,Jb) = aJ^-^l^, 

where aj - itxizj) and A^(i) is the longest palindrome such that Af^(h)f^{b) (resp. (fj.(b))~ A/j^i,)) is 
a prefix (resp. suffix) of a j. 

Since s is uniformly recurrent, there exists an integer p such that all factors of s of length n j 
are factors of a j+p. Hence, there exist p letters b], . . . ,bp e B such that aj+p = ipxizjf^ibi) ■ ■ ■ f^(bp)). 
By Theorem IS .131 one has 

>^x(Zjp(bi) ■ ■ -pibp)) = ajA-l^^ajA-l^_^ ■ ■ ■ ajA'^^^^^aj. 

Thus aj covers aj+p and the overlaps between two consecutive occurrences of aj in aj+p are 
given by A^(i;), 1 < / < p. Any factor of s of length nj will be a factor of two consecutive 
overlapping occurrences of aj, i.e., of 

o^A^i)"], i^l,...,p. (18) 

For any 1 < i < p the number of distinct factors in (fTSl ) is at most nj - |A^(i )| < nj - 1. Since 
fi{B) = X and the number of distinct consecutive overlapping occurrences of aj in aj+p is at most 
card(X), equation ( fTTb is readily derived. 

Now let n be any integer n > no such that n + n^ for all k > 0. There exists an integer 
j such that nj < n < nj+i. Since s is not periodic, by a classic result of Morse and Hedlund 
(see I12I Theorem 1.3.13]) the factor complexity ps is strictly increasing with n. Moreover, as 
rij+i <2nj < 2n, one has by ( fTTI ): 

Psin) < Psi'ij+i) < caid{X)nj+i - card(X) < 2card(X)n - card(X), 

which concludes the proof. □ 



6. Conservative maps 

Let A be an alphabet of cardinality d > I and let X be a code over A. We say that the 
pahndromization map i^x is conservative if 

>/fx(X*)QX*. (19) 

When X - A, the pahndromization map i/r is trivially always conservative. In the general case 
ifrx may be non conservative. 
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Example 6.1. Let X - {ab, ba). One has i^x(ab) - aba i X*, so that i//x is not conservative. In 
the case Y = {aa,bb} one easily verifies that tf/yiY*) c Y*. If Z - [a,ab] one has that for any 
word w € Z*, i/'zCw) e aA* \ A*bbA*, with A = {a, b], so that it can be uniquely factorized by the 
elements of Z. This implies that tfrz is conservative. 

The following result shows that a prefix code having a conservative palindromization map 
allows a natural generalization of properties P2 and P3 of Proposition l2.1l in addition to the ones 
for PI and P4 shown in Propositions I3.1l and l3.2l 

Proposition 6.1. Let X be a prefix code such that i//x is conservative, and p,w & X* with p a 
prefix ofij/x{w). The following hold: 

1. p*-^' is a prefix ofipxiw) and p*-^' 6 X* . 

2. If p is a palindrome, then p — [l/x{u)for some prefix u e X* ofw. 

Proof. Let w = xiX2 - ■ ■ x^ with Xj e X for 1 < / < A;, and let v be the longest prefix of w in X* 
such that ipxiv) is a prefix of p; we can write v = xi ■ ■ ■ x„ or set n = if v = s. Thus p = t//xiv)( 
with ( e A*. Since ifrx is conservative one has if/x{v) e X*. Moreover, as X is a prefix code, X* 
is right unitary, so that one has ( e X*. If ^ = e, then p = t//xiv) = /J^^' and there is nothing to 
prove. Let us then suppose [, + s. Since i/'x(v)x„+i, as well as p, is a prefix of and X is a 

prefix code, one has that ^ E x„^\X* . Thus is a prefix of p. 

From the definition of palindromic closure it follows that |(iAx(v)x„+i)^^'| < \p'-^\ By the 
maximality of n, we also obtain that p is a (proper) prefix of (i/'x(v)x„+i)*^* = ^x(m ■ ■ ■ so 
that 1/9*^*1 < \{ilix(y)^n+\f^\ Thus |/?*^*| = |(iAx(v)-Xn+i)*^*l- Since p*^* is a palindrome of minimal 
length having i/'x(v)jc„+i as a prefix, from the uniqueness of palindromic closure it follows that 
= \iix(yXn+\)- Hence, p*-^' is a prefix of lA^xCw), and p^^' e X* as i^x is conservative. 

If is a palindrome and ^x(y), then the argument above shows that p^^^ - P - ^x{vx„+i), 
which is absurd by the maximality of n. □ 

The following proposition gives a sufficient condition which assures that iffx is conservative. 

Proposition 6.2. Let X C PAL be an infix and weakly overlap-free code. Then ipx is conserva- 
tive. 

Proof. We shall prove that for all n > one has that iJ/xiX") c X*. The proof is by induction on 
the integer n. The base of the induction is true. Indeed the case n = is trivial and for « = 1, 
since X c PAL, one has tfrxiX) = X. Let us then suppose the result true up to n and prove it 
for n + 1. Let w e. X" and x e X. By induction we can write ilfxiw) - x[ - ■ ■ x'^, with x,' e X, 
1 < i < m. Thus: 

Mwx) = = (x'l ■ ■ ■ x'„xY^K (20) 

Let Q denote the longest pahndromic suffix of x'.^ - ■ ■ x'^x. Since x 6 PAL we have \Q\ > \x\. We 
have to consider two cases: 

Case 1. 121 = \x\. From ^ and X c PAL, it follows: 
Thus ^i'x(>vJi:) e X* and in this case we are done. 



Case 2. \Q\ > \x\. One has: 
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Since \Q\ > \x\ and x,Q e PAL, there exists I < j <m such that x'j - A/u, A, fie A* and 

f^^'j+l ---X'^X^ XT], 

with T] e A*. We shall prove that A = s. Indeed, suppose that A ^ s. We have to consider the 
following subcases: 

1) |x| < |yu|. This implies that x is a proper factor of x'j which is a contradiction, since X is an infix 
code. 

2) |x| > l/ix^.^jl. In this case one has that x^.^j is a factor of x which is a contradiction. 

3) |//| < |x| < Ijt/jc^^J. This implies that x = fip, where p is a proper prefix of x'j^^. Since ju is a 
proper suffix of x'j we reach a contradiction with the hypothesis that X is weakly overlap-free. 

Hence, A = s and n - x - x'y Therefore, one has, as X c PAL, 

iAx(h'x) = {x\ ■ ■ ■ x'^x)^^'^ = -^'r ■ ■ x'j-\xx'j^\ ■ ■ ■ x'„xx'j_^ ■■■x\ eX*, 
which concludes the proof. □ 

Example 6.2. Let X = {bab, bcb). One has that X c PAL. Moreover, X is an infix and weakly 
overlap-free code. From the preceding proposition one has that i//x is conservative. 

Let us observe that Proposition 16.21 can be proved by replacing the requirement X c PAL 
with the two conditions: X = X~ and il>x{X) Q X*. However, the following lemma shows that if 
the code X is prefix these two latter conditions are equivalent to X c PAL. 

Lemma 6.3. Let X be a prefix code. Then one has: 

X c PAL X = X~ and i//xiX) c X*. 

Proof. If X c PAL, then trivially X - X~. Moreover, for any x e X one has ifrxix) = x*^^ - x e 
X*. Let us prove the converse. Suppose that x e X is not a palindrome. We can write x - AQ, 
where Q = LPS (x) is the longest palindromic suffix of x and A b. One has, by hypothesis: 

il,^{x) = = AQA~ = xA~ e X* . 

Since X is a prefix code, from the right unitarity of X* one has A~ e X* . AsX - X" it follows 
A e X*. Since x = AQ and X is a prefix code, one derives A - x and Q - e which is absurd as 
\Q\ > 0. □ 

Proposition 6.4. Let X c PAL be a prefix code. Then: 

tffx is conservative <=> /or all x 6 X, LPS (>J/x(X*)x) C X* . 

Proof. Let w 6 X*. If w = e, since X c PAL, one has LPS{x) - x e X. Suppose w e, so 
that w - xi ■ ■ ■ x„, with Xj e X, i = 1, . . . , n. Let x e X and Q be the longest palindromic suffix of 
il/x{x\ ■ ■ ■ Xn)x. We can write: i//xixi ■ ■ ■ x„)x - 6Q with 6 e A* and 

>l'x{x\ ■ ■ ■ Xnx) = {^x{x\ ■ ■ ■ x„)x)*+^ = 6Q6~ = ipxixi ■ ■ ■ x„)x6~. 
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Since ipx is conservative, one has i/zxixi ■ ■ ■ x„), ifrx(x\ ■ ■ ■ x„x) e X*, so that as X is a prefix code 
from the preceding equation one derives 6~ e X* and then 6 e X* because X c PAL. Finally, 
from the equation t//x{xi ■ ■ ■ x„)x - 6Q it follows Q e X* as X is a prefix code. 

(<=) We shall prove that for all n > one has iffxiX") £ X*. The result is trivial if n = 0. For 
n - I one has that for any x e X, ij/xix) - x*^^' - x as X c PAL, so that t//x(X) c X*. Let us now 
by induction suppose that ij/xiX") c X* and prove that £ X* . Let x\, . . . ,Xn,x e X and 

let Q denote the longest palindromic suffix of i/'xC-^i ■ ■ ■ x,^x, so that 

^x{xi ■ ■ ■ x„)x = 6Q, 

with 6 e A*. The code X is bifix because X is a prefix code and X c PAL. Since by hypothesis 
Q,t//x(xi ■ ■ ■ x„) e X*, from the preceding equation and the left unitarity of X*, one gets 6 e X*. 
Moreover, 6~ e X* since X c PAL. Hence, one has: 

i^xixi ■ ■ ■ x„x) = (\l/x{x] ■ ■ ■ x„)x)'-*'' = 6Q6~ e X*, 

which concludes the proof. □ 

Let X be a code over the alphabet B and (p : A* ^ B* an injective morphism such that 
(f{A) = X. We say that tpx is morphic-conservative if for any w e A* one has 

ip^w)) = iffxiipiyv)). (21) 

Example 6.3. Let A - {a,b}, B - {a,b,c}, X - {c,bab], and if : A* ^ B* be the injective 
morphism defined by (f(a) - c and ip(b) - bob. Let w = abaa; one has (ACw) - abaabaaba, 
ipiw) — cbabcc, and 

ip(if/{w)) — cbabccbabccbabc — i/zxi'piw)). 
As a consequence of Corollarv l6.11l one can prove that t//x is morphic-conservative. 
Lemma 6.5. Ifi//x is morphic-conservative, then it is conservative. 

Proof. Let u e X* . The result is trivial if m = e. If m is not empty let us write u - xi - ■ ■ x„, with 
Xi e X, i - I, . . . ,n. Since if is injective, let a, 6 A be the unique letter such that jc, - (f{ai). 
Therefore, u - ip{a\ ■ ■ - a,,). By (|2T]) one has i/'x(m) - ip{4>{ci\ ■ ■ £ X*, which proves the 
assertion. □ 

The converse of the preceding lemma is not true in general. Indeed, from the following 
proposition, one has that if i/'x is morphic-conservative, then the words of X have to be palin- 
dromes. However, as we have seen in Example 16. II there are ^x which are conservative with a 
code X whose words are not palindromes. 

Proposition 6.6. If>l'x is morphic-conservative, then X c PAL and X has to be a bifix code. 

Proof. Let a be any letter of A and set x = f{a). One has from jTli that (p{i//(a)) = if {a) - 
X - ipxi'Pia)) - ipx{x) = x^'^K Hence, x = x*^' E PAL, so that all the words of X have to be 
palindromes. 

Let us now prove that X is a suffix code. Indeed, suppose by contradiction that there exist 
words x,y e X such that y - Ax with A e A^ . Let a,b e A be letters such that (p(a) - x and 
f{b) = y. For w - ba one has: 

fiipibaj) - if(bab) - yxy. 
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and, recalling that y e PAL, 



Since e PAL, the longest palindromic suffix Q of /Ixx has a length \Q\ > 2\x\. Thus 



\{AxxV>\ < \AxxX-\ = 2|3;| < \yxy\. 



which is absurd. Hence, X has to be a suffix code and then bifix as X c PAL. 



□ 



Remark 6.7. As a consequence of Lemma 16.51 and Proposition 16.61 every code X having a 
morphic-conservative ij/x satisfies the hypotheses of Propositions 13.11 13.21 14.11 and 16.11 so that 
all properties P1-P4 in Proposition 12.11 admit suitable generalizations for ^x- Let us highlight in 
particular the following: 

Proposition 6.8. If>l'x is morphic-conservative, then it is injective. 

Proof. From Proposition l6.6l the code X has to be bifix, so that the result follows from Proposi- 
tion O □ 

Let us observe that in the preceding proposition one cannot replace morphic-conservative 
with conservative. Indeed, for instance, if X = {a, ab] then ifrx is conservative (see Example l6.1l l 
but it is not injective, since if/xiaba) = ij/xiabab). 

The following theorem relates the two notions of conservative and morphic-conservative 
palindromization map. 

Theorem 6.9. The map tf/x is morphic-conservative if and only ifX c PAL, X is prefix, and tffx 
is conservative. 

For the proof of the preceding theorem we need the following 

Lemma 6.10. Let tp : A* ^ B* be an injective morphism and (p{A) — X c PAL. For any w e A*, 

ip(w~) — (ifi(w))~. Thus for any w € A*, (p(w) — ((p(w))~ if and only ifw e PAL. 

Proof. The result is trivial if w = s. Let us suppose w s and write w as w = ai ■ ■ - an with 
a, e A, 1 < i < n. One has 

(fiiw) = (^(fli) ■ ■ ■ (p(a„) ^ Xi---Xn, 

having set x, = tpiaj) e X, 1 < i < n. Since X c PAL, one derives 



((p(w))~ = x„ ■ ■ ■ 



xi = tp{w~). 



As ip is injective one obtains: 



(p(w) 



(ip(w))~ - (p(w~) if and only if w — w~. 



which concludes the proof. 



□ 
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Proof of Theorem \6.9\ (=>) Immediate from Proposition l6.6l and Lemma l63] 

Let 1/3 : A* — > B* be an injective morphism such that (f{A) = X is a prefix code and X c PAL. 
We wish to prove that for any w e A* one has; 

The proof is by induction on the length n of w. The result is trivial if « = 0. If n = 1, i.e., 

w = fl 6 A, one has, as ifia) e PAL, 

(p(ifr{a)) = (p(a) = i/rx((^(fl)). 

Let us then suppose the result true up to the length n and prove it for n + 1 . We can write, by 
using the induction hypothesis and the fact that ip(w) € X*, 

Let z - iffiw); we need to show that (^(z)(^(a))^^' = ipi^iwa)). As ij/x is conservative, by Proposi- 
tion 16.41 the longest palindromic suffix Q of il/x{if{wy)(f{d) - ip{z}f{a) belongs to X* . Since ifia) 
is a palindrome and X is a suffix code, there exists a suffix v of z such that Q - (p(v)(p(a). Using 
Lemma l6.10l one derives that va is the longest palindromic suffix of za, so that, letting z = uv, 

(i^(z)^(a))*^' - (p(uvau~) - ^((za)*^') = (f(iff(wa)), 

which concludes the proof. □ 

Corollary 6.11. Let X be a weakly overlap-free and infix code such that X c PAL. Then tpx is 
morphic-conservative. 

Proof. Trivial bv Proposition l6.2l and Theorem |6.9l □ 

Remark 6.12. The hypotheses in the previous corollary that X is a weakly overlap-free and 
infix code are not necessary in order that ^x is morphic-conservative. For instance, let X be the 
prefix code X - {aa, cbaabc). One has that bcX* n PAL = 0. From this one easily verifies that 
for all « > 0, if i//x(X") c X*, then for x e X, LPS{\px{X")x) c X*. Thus by using the same 
argument as in the sufficiency of Proposition l6.4l one has that ^xi^"^^) C ^* ■ It follows that ijjx 
is conservative and then morphic-conservative by Theorem l6.9l 

Let \jjx be a morphic-conservative palindromization map and if : A* ^ B* the injective 
morphism such that X - ipiA) and (f o ijj - ilix o ip. Since X has to be bifix, (f can be extended to 
a bijection (.p : A" ^ X". The extension of if/x to X" is such that for any x e X'^ 

ilJx(x) = ipiilj{ip-'{x))). 

For any x e X'^ the word is an epistandard word over A, so that 

i//xiX") = ip{EpistandA). 

Therefore, one has: 

Proposition 6.13. The infinite words generated by morphic-conservative generalized palin- 
dromization maps are images by injective morphisms of the epistandard words. 



6 CONSERVATIVE MAPS 



25 



Let us now consider the case when X is a finite and maximal prefix code. 

Lemma 6.14. IfX is a finite and maximal prefix code over A such that X c PAL, then X — A. 

Proof. Let ix be the maximal length of the words of X. Since X is represented by a full d-ary 
tree, there exist d distinct words pa e X, with p a fixed word of A*, a e A, and \pa\ - (x- As for 
any a e A, the word pa e PAL, the only possibility is = e, so that X - A. □ 

Proposition 6.15. Let X be a finite and maximal prefix code over A. Then ij/x is morphic- 
conservative if and only ifX — A. 

Proof The proof is an immediate consequence of Theorem |6.9| and Lemma 16.141 □ 

In the case of a finite maximal prefix code the map ilix can be non conservative. For instance, 
if X - {a, ba, bb], then if)x(ba) = bab i X* . The situation can be quite different if one refers to 
infinite words over X. Let us give the following definition. 

Let X be a code having a finite deciphering delay. We say that ij/x is weakly conservative if 
for any t € X", one has 4'xif) e X"; in other terms the map ^x ■ X'^ A'^ can be reduced to a 
map if/x '■ X" X". In general, tfrx is not weakly conservative. For instance, if X = {ab, ba] and 
t e ababX'^, then i^^CO i X'^- 

Trivially, if iffx is conservative, then it is also weakly conservative. However, the converse is 
not in general true as shown by the following: 

Theorem 6.16. IfX is a finite and maximal prefix code, then ipx is weakly conservative. 

Proof. Let s - tffx{t) where X a finite and maximal prefix code and f e X"^. We recall l'9'l that 
any maximal prefix code is right complete, i.e., for any / e A*, one has fA* n X* + %. \fX is 
finite, then for any / e A* and any letter a e A one has: 

/fl* e X\ 

for a suitable integer k, depending on / and on a, such that Q < k < {, where ^ = is the 
maximal length of the words of X. Let a be a fixed letter of A. We can write: 

i[„]a*" e X\ 

with < kn <{. Setting p = [fJ. we can write: 

S[„] = X1X2 ■ --Xq^A, 

with Xi e X,i - 1, . . .,q„, q,, > p and \A\ < {. Now S[„] < so that since X is a prefix code, 

one has: 

S[n+(] - X\X2 ■ ■ ■ Xq^^^A' , 

with qn+e > qn, Xi eX,i - q„+u . . . , q„+(, and \A'\ < £. Since 

lim x\-- ■ x„ E X" 

and lim„^oo xi - ■ ■ jic^„ - lim„^oo the result follows. □ 

Corollary 6.17. Let s — ij/xit) with t e X'^ be an X-AR word. Then s is the morphic image by an 
injective morphism of a word w e B'^, where B is an alphabet of the same cardinality as X. 
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Proof. By the preceding theorem, since ifrx is weakly conservative, we can write: 

with X, e X, / > 1 . Let B be an alphabet having the same cardinality of X and tp : B* —> X* he the 
injective morphism induced by an arbitrary bijection of B and X. If ip^^ is the inverse morphism 
of (fi one has: 

(fi'Us) = 9?"'(xi)9j"'(x2)---9?"'(jc„)--- . 

Setting (p^\xi) = w, e B for all / > 1, one has (p^^(s) = wiW2 • ■ ■ w„ ■ • • = w € B'^ and i = 
(p(w). □ 

Let us observe that in general the word w e is not episturmian as shown by the following: 
Example 6.4. Let X = {a, ba, bb) and s - il/x{{ababby). One has: 

s — ababbabaababbabababbabaababbaba ■ ■ ■ . 

Let B = {0, 1, 2) and tp the morphism of B* in X* defined by the bijection ^(0) = a, = ba, 
and (fi(2) - bb. One has: 

w = <^-i(.?) = 0120101201120101201 ■■■ , 

and the word w is not episturmian (indeed, for instance, the factor 01201 is not rich in palin- 
dromes). 

7. The pseudo-palindromization map 

An involutory antimorphism of A* is any antimorphism : A* — > A* such that o i? = id. 
The simplest example is the reversal operator R : A* — > A* mapping each w e A* to its reversal 
w~. Any involutory antimorphism )? satisfies & = toR = Rot for some morphism r : A* ^ A* 
extending an involution of A. Conversely, if t is such a morphism, then ^-ToR-Rorisan 
involutory antimorphism of A*. 

Let § be an involutory antimorphism of A*. For any w E A* we shall denote ??(w) simply 
by M>. We call ■^-palindrome any fixed point of )?, i.e., any word w such that w - w, and let 
PALi^ denote the set of all i?-palindromes. We observe that s e PAL^^ by definition, and that 
/^-palindromes are exactly the usual palindromes. If one makes no reference to the antimorphism 
1?, a i?-palindrome is called a pseudo-palindrome. 

For any w € A*, w®'', or simply w®, denotes the shortest )?-palindrome having w as a prefix. 
If Q is the longest )?-palindromic suffix of w and w - sQ, then 

w® = sQs. 

Example 7.1. Let A - [a, b, c} and be defined as a = i>, c = c. If w = abacabc, then = cai>c 
and w® = abacabcbab. 

We can define the §-palindromization map : A* PAL§ by ^!',y(e) = e and 

tffif(ua) - {^ji{u)af 

for M € A* and a € A. 

The following proposition extends to the case of ??-palindromization map i/',^ the properties 
of palindromization map ^ of Proposition 12 . 1 1 (cf . . for instance, [SJ): 
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Proposition 7.1. The map tff,f over A* satisfies the following properties: for m, v e A* 
PI. If u is a prefix ofv, then ijf§{u) is a ^-palindromic prefix (and suffix) oftff,'}(v). 
P2. If p is a prefix q/i^,j(v), then p® is a prefix ofi]/§{v). 

P3. Every ^-palindromic prefix ofi]/§{v) is of the form i]/§{u)for some prefix u ofv. 

P4. The map i]/§ is injective. 

The map t//ff can be extended to infinite words as follows: let jc - xiX2 ■ ■ ■ x„ ■ ■ ■ e A^ with 
Xi e A for / > 1. Since for all n, tfr-ff(x[„]) is a prefix of iff-s{x[„+i]), we can define the infinite word 
ifi,f(x) as: 

if/,f(x) = Hm ij/,t(x[„ii . 

The infinite word x is called the directive word of ^§{x), and s - t//§(x) the ■d--standard word 
directed by x. If one does not make reference to the antimorphism § a i?-standard word is also 
called pseudostandard word. 

The class of pseudostandard words was introduced in [5J. Some interesting results about 
such words are also in 171- In particular, we mention the noteworthy result that any pseudo- 
standard word can be obtained, by a suitable morphism, from a standard episturmian word. 

More precisely let iu§ be the endomorphism of A* defined for any letter a e A as: yU,?(fl) = a®, 
so that fi^{a) = a if a - d and fj.{)(a) - ad, if a a. We observe that fi^ is injective since jU,y(A) 
is a prefix code. The following theorem, proved in (3], relates the maps if/§ and i// through the 
morphism yu^. 

Theorem 7.2. For any w e A°°, one has ij/i)(w) - ^§(i]/(w)). 

An important consequence is that any i?-standard word is a morphic image of an epistandard 
word. 

A generalization of the pseudo-palindromization map, similar to that given in Section [3] for 
the palindromization map, is the following. Let d- be an involutory antimorphism of A* and X a 
code over A. We define a map: 

: X* ^ PALif, 

inductively as: t//§^x(s) = e and for any w e X* and x e X, 

^»^{wx) = (i/'i^,x(w)x)®. 

\f - R, then ^rx - 4'x- If ^ = A then i/r^^ - The map ijj^x will be called the -d-- 
palindromization map relative to the code X. 

Example 7.2. Let A - {a, b, c] and § be defined as a = and c - c. Let X be the code 
X — [ab, ba, c] and w — abcba. One has: )l/§ xi.o.b) — ab, i]/§xiabc) — abcab and i//§x(abcba) — 
abcabbaabcab. 

Let us now consider a code X having a finite deciphering delay. One can extend t//§x to X'^ 
as follows: let x = xiX2 ■ ■ ■ x„ ■ ■ ■ , with x, e X,i > I. For any n > 1, ij/,f^x{xi ■ ■ ■ x„) is a proper 
prefix of 4'».x{x\ ■ ■ ■ x„x„+i) so that there exists 

Um ip,ix(xi ■ ■ ■ x„) = i^,ix(x). 
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Let us observe that the word ip^^xix) has infinitely many ??-palindromic prefixes. This impHes 
that if/^^xix) is closed under §, i.e., if w e Factil)ff x{x), then also w e Factilr^^xix). 

We remark that the maps i//§^x and their extensions to X", when X is a code with finite 
deciphering delay, are not in general injective. The following proposition, extending Propositions 
I3.2l and l4n can be proved in a similar way. 

Proposition 7.3. Let X be a prefix code over A. Then the map ij/ji^x ■ PAL{^ and its 

extension to X'^ are injective. 

Several concepts, such as conservative and morphic-conservative maps, and results consid- 
ered in the previous sections for the map i/^x can be naturally extended to the case of the map 
^»,x- We limit ourselves only to proving the following interesting theorem relating the maps i/r^ 
and il/^^x where X - n§iA). Combining this result with Theorem l7.2l one will obtain that i/^^x is 
morphic-conservative. 

Theorem 7.4. Let A be an alphabet, § an involutory antimorphism, and X — /JaiA). Then for 
any w e A°° one has: 

if^dw) = iA.?,x(/"i>(w)). 

Proof. It is sufficient to prove that the above formula is satisfied for any w € A*. The proof is 
obtained by making induction on the length of w. 

Let us first prove the base of the induction. The result is trivially true if w = e. Let w = a e A. 
If a — a, then a € X and tffft(a) — a — tffff x(fJ-Mo)) — If a a, one has fJ.§(a) — aa € X 

and tffff(a) = aa - il/^^xi^dfl)) - 4'»,x(aa). 

Let us now prove the induction step. For w e A* and a e A we can write, by using the 
induction hypothesis, 

i/r^(wa) = (i/r^(w)fl)® = (i/',:>,x(A',?(w))fl)®. (22) 

Moreover, one has: 

ilr^f^xiMdwa)) = ^,y,x(jU,,(w)fl®) = (i/r,,,x(ju,y(w))fl®)® = (^,,(w)fl®)®. (23) 

We have to consider two cases. If a - a, then a® = a, so that from the preceding formulas (l22l l 
and (|23] ) we obtain the result. 

Let us then consider the case a + a. We shall prove that {t//§{w)a)'^ - ifr,<t{wa) has the prefix 
p = iff{t(w)ad, so that from property P2 of Proposition 17. 1 1 one will have /?® < t//§(wa). Since 
ifr,<t{w)a < p, one will derive that \iff,t{wa)\ - \{t//§(w)a)'^\ < |/7®| so that = (t//§(w)a)'^ from 
which the result will follow. We have to consider two cases: 

Case 1. i//^(w) has not a j?-palindromic suffix preceded by the letter a. Thus 

(i/',?(w)a)® = ifr,f(w)adt^/f(w), 

so that in this case we are done. 

Case 2. i/'^(w) has a )?-palindromic suffix u of maximal length preceded by the letter a. Since u 
is also a )?-palindromic prefix of i/'^(w), by property P3 of Proposition l7. 1 I there exists v prefix of 
w such that u = i^^(v). Since du is a suffix of t//§(w) one has that ua = t//ff(v)a is a prefix of t//§(w). 
By property P2 of Proposition l7.1l (i^^(v)fl)® is a prefix of iff{t{w). 

Since |v| < |w| one has \va\ < \w\. By using two times the inductive hypothesis one has: 

(i/'f>(v)a)® = i^,f(va) = tj/^^x(f^^(v)ad) = (\f/{fj{p.»{v))adf = (lAd(v)aa)®. 
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Hence, i//§(w) has the prefix uaa and the suffix aau, so that if/^iw) - Aaau with A* and 

{i]/§{w)a)® — AaauaaA — i///t(w)aaA, 

from which the result follows. □ 

From Theorems 17. 2l and l7.4l one derives the noteworthy: 

Corollary 7.5. Let A be an alphabet, & an involutory antimorphism, and X — ^1§{A). Then one 
has: 

Example 7.3. Let A = {a, b], § be defined asa-b, and X - §(A) = {ab, ba}. Let w = aab. One 
has (//(aab) — aabaa, ip^^aab) — ababbaabab — jjL^{aabaa). Moreover, jigiaab) — ababba and 
i^§^x(ababba) - ababbaabab. 
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